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Abstract 

Obtaining a good momentum reconstruction of a jet is a compromise between tak- 
ing it large enough to catch the perturbative final-state radiation and small enough to 
avoid too much contamination from the underlying event and initial-state radiation. 
In this paper, we compute analytically the optimal jet radius for dijet reconstruc- 
tions and study its scale dependence. We also compare our results with previous 
Monte-Carlo studies. 
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1 Introduction 

With the start of the LHC as a main source of motivation, jet physics has seen a tremendous 
development over the last couple of years. A significant amount of effort has been placed 
in trying to define jets in an optimal way for new physics searches. Just to mention a few 
examples, this includes the introduction of new jet algorithms [11 |2], new techniques to 
clean the Underlying Event (UE) contribution to the jets |3l HI El E] , or a large series of 
sub-jet techniques aimed at tagging boosted objects [21 [5l [71 [H [9l [TOl [TTl [12] . 

An aspect that has been slightly less investigated is the values that one should use for 
the parameters inherent to a jet definition. A noticeable example is the case of the "radius" 
of a jet, often denoted by R, common to all the standard jet definitions used in hadronic 
collisions. It has been noticed [4j that if one wants to optimise the kinematic reconstruction 
of dijet events at the LHC, it is mandatory to adapt the radius of the jet when varying the 
scale of the process and the type of partons involved in the reconstruction. 

With the appearance of new techniques, new parameters are added and one might 
expect the corresponding multi-dimensional optimisation to become more and more in- 
volved. There is therefore a risk that a poor choice of parameters for the definition of jets 
counteracts the positive effect of these new refinements. 
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In this paper we thus want to take an alternative, complementary, approach and, instead 
of adding a new procedure with its own parameters, study how it is possible to fix one 
parameter in an optimal way given the properties of the events we want to reconstruct. 
In other words, we shall remove one free parameter by determining it analytically from 
the scales in the process we shall look at. Practically, we shall focus on the determination 
of the most natural parameter, the radius R, for the reconstruction of a massive colour- 
neutral object decaying into two jets, as done within a Monte Carlo approach in [^|. In 
that case, we want to choose R large enough to catch the perturbative QCD radiation, but 
not too large to avoid an excessive contamination from the UE. Our approach thus goes 
along the same fine as [I3], except that we shall extend the discussion to include extra 
features requested by the fact that we are looking at a peaked distribution and wish to 
obtain a more precise determination of R. Contrarily to the case of [H] , where an attempt 
was made to adapt dynamically the size of the jet with its hardness by replacing the size 
parameter with a dimensionful scale varying with the process under consideration, we shall 
determine a unique value for R from the hard scale of the event and the scale of the UE, 
i.e. not introducing any alternative parametei0. 

The paper is organised as follow: in Section [2] we describe the process we will study as 
well as the details on how we proceed with the event analysis. We also describe in that 
Section our strategy for the analytic computation of the optimal radius -Rbest- We then 
proceed with the computation of -Rbest itself. Section |3] concentrates on the situation at 
the partonic level, where only perturbative radiation has to be taken into account; while in 
Section H] we add to the picture the contamination due to the UE. In both cases, we shall 
compare our results with what we obtain from Monte-Carlo studies, as in Finally, in 
Section |5l we shall discuss the extraction of the optimal jet radius for situations where we 
perform an underlying event background subtraction. 



2 Dijet decay of a massive object 

As in [4J, we shall study the hard processes qq ^ Z' ^ qq and gg ^ H ^ gg, where the 
(fictitious) Z' and H are made very narrow. The main advantage of such simple processes 
is that one can easily study the definition of jets at a given scale, by varying the mass M 
of the colourless resonance. It also allows one to study the differences between quark and 
gluon jets, and to test the validity of our computations with Monte-Carlo simulations. 

The procedure for the event analysis is straightforward: we cluster the events with a 
given jet definition, select the two highest-pt jets and reconstruct the heavy object from 
them. In Monte-Carlo studies, the events were generated either with Pythia [15] (tune 
DWT) or with Herwi^ [16] with the default Jimmy tune for the UE. We have required 
that the jets satisfy pt > 10 GeV and \y\ < 5, and we have further imposed that the two 

^In the case of [14j . this would mean predicting the scale parameter as a function of known properties 
of the events. 

^In the case of Herwig, we have actually generated qq — >■ Graviton — > qq and gg — > Graviton — > gg 
events, which does not make any difference in the discussions throughout this paper. 
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hardest jets, the ones used to reconstruct the heavy object, are close enough in rapidity, 
I Ay I < 1 (so that the transverse momentum of the jets remains close to M/2). 

The quantification of the performance of a given jet definition is also borrowed from |4]: 
we define QJ=2 as the width of the smallest mass window that contains a fraction f = z 
of the reconstructed object^. With that definition, a smaller QJ=z would correspond to a 
narrower peak and thus a better reconstruction quality. Finding the optimal R, Rhest, is 
therefore equivalent to finding the minimum of Q^=zi seen as a function of R. 

Our task in this paper is to perform an analytic computation of the spectrum of the 
reconstructed mass peak dP/dmrec, or dP/d5m with 5m = mj-ec—M, the difference between 
the reconstructed and the nominal mass. By integrating that spectrum, we can compute 
the probability P{Smi, 61712) for the difference Sm to be between 6mi and 6m2. If we write 
the quality measure as QJ=z — I2 ~ Qi, where qi and q2 are the lower and upper end of 
the mass window defining Q^=zi ^ straightforward exercise to show that it can be 
computed from the analytic spectrum by solving 



dP 



d5m 



dP 



and P(gi,g2) = z. (1) 

5m=q2 



Repeating this for different values of P, one can then compute Pbest, the value of R for 
which the quality measure is minimal. We shall use z = 0.25. 

As far as the analytic computation itself is concerned, there are three physical contribu- 
tions that will affect the width of the reconstructed peak: final-state radiation, initial-state 
radiatioE0 and the UE. The first two are perturbative; while final-state radiation leads to 
losses by emissions out of the jet, initial-state radiation adds extra, unwanted, radiation 
inside the jet. Already at the purely partonic level, there is thus an optimal radius. At 
this stage, the mass spectrum will depend on the mass of the reconstructed object, M, and 
whether it decays into quarks or gluons since, perturbatively, gluons radiate more than 
quarks. 

The third contamination, the underlying event, is softer. Like the initial-state radiation, 
it tends to move the reconstructed mass towards larger values by clustering soft particles 
into the jet. The amount of contamination involves another scale: the density of UE (per 
unit area) p. The most important source of dispersion in the mass spectrum comes from 
the fact that p varies from an event to another. 

In the next two Sections, we first concentrate on the purely perturbative behaviour, 
then include the effect of the UE. But before getting our hands dirty, there is one additional 
point that needs to be discussed: the fact that we should also expect some effect due to 
hadronisation. Actually, for the purposes of this paper, hadronisation plays a limited role. 
In theory p^J, it leads to a loss that increases like 1/P at small P, to be compared with a 



•^In [4], QJ=z defined as a fraction of the generated objects rather than a fraction of the reconstructed 
ones. Though the former is practically more reliable, choosing the later does not affect i?bcst in practice 
and simplifies considerably the analytic computation. 

^Since the produced massive object is colourless, there is no interferences between gluons emitted from 
the initial-state and final-state partons. 
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log(l/-R) for the perturbative component, and so should dominate the small- i? behaviour. 
However, the normalisation is such that for practical values of R the perturbative contri- 
bution dominates, and the effect of hadronisation on the value of -Rbest remains negligible. 

A final comment concerns the jet algorithm that we shall consider: we will mostly focus 
on the aiiii-kt algorithm [2j as its behaviour is simpler. In Section we shall discuss the 
case of the Cambridge/ Aachen [T7j and Cambridge/Aachen with filtering [3lll] algorithms. 

3 Perturbative QCD spectrum 

The part of the spectrum that is the most naturally computed is the pure perturbative 
QCD contribution. This is equivalent to concentrating on partonic events. In that case, 
two types of contribution can affect the reconstruction of the jet momentum: final-state 
radiation outside of the jet which leads to an underestimation of the jet momentum, and 
in-jet initial-state radiation yielding an over-estimation of the momentum. 

In this Section, we shall thus compute the pure partonic spectrum. We do this at the 
one-gluon-emission (OGE) level. Since we are interested in the behaviour of the spectrum 
close to the mass peak, we are dominated by the 1/z singularity directly coming from the 
soft divergence of QCD. In that region, to get an integrable spectrum around the mass peak, 
it is also important to resum the (a^ log(l/z))'^ contributions to all orders, which we shall 
do assuming a Sudakov-like exponentiation, neglecting non-global logarithms [T^ [20]. 

For the clarity of the computation, we first deal with the soft approximation, where we 
only keep the 1/z part of the gluon emission. We then introduce the sub- leading corrections 
coming from the PDF in the initial state which can have non-negligible effects, especially 
in the case of gluon resonances at large mass. 

3.1 Soft-gluon emission approximation 

Because of longitudinal boost invariance, we can assume that the heavy object is produced 
at y = 0. Furthermore, because of our cut |A?/| < 1 on the dijets, we shall assume that the 
decay products are at the same rapidity. The correction due to the finiteness of Ay can 
be computed but has very little effect (at most a few percent). Since the calculation for a 
nonzero Ay cut becomes rather technical, we shall keep things simple and assume Ay = 0. 
At the lowest order of perturbation theory, the incoming (pi,2) and outgoing (A;i,2) partons 
thus have the following momenta 

Pi 
Pt 

K 

^2 



= y (0,0,1,1) 
M 

^-(0,0,-1,1) 

M , 

= Y (1,0, 0,1) 

M , 

(-1,0,0,1) 
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We shall consider the emission of an extra soft gluon: 



k'^ = (cos(0), sin(0), sinh(?/), cosh(?/)) 

with z 1. The probability for such an emission to happen is given by the antenna 
formula 

for an emission from the initial state, and a similar expression with pi replaced by ki for 
final-state radiation. The pre-factor Cr is a colour factor which should be equal to for 
an emission from a quark line and Ca for an emission from a gluon line. 

Note that in the case of a single soft emission, the gluon is caught if and only if it is 
at a geometric distance smaller than R from one of the two "leading" partons in the final 
stat^. 

If we denote by 5m the difference, mrec — M, between the reconstructed mass and the 
nominal one, initial-state radiation will lead to an overestimation of the jet mass with 
Sm = zM/2cosh.{y) whenever the additional gluon is radiated inside the jet, while final- 
state radiation outside the jet will lead to an underestimation of the mass with 6m = 
-zM/2 cosh(?/). 

Integrating over the rapidity and azimuth of the emitted gluon and using ([2]), we find 

-7^ = 2/ d0d?/ — = —Ai[R)— (3) 

ddm ymin(<A2,(7r-0)2)+?;2<i?2 ^^m TT^ dm 

for the initial-state radiation spectrum, and 



d5"^ 7min(</,2,(7r-0)2)+j;2>R2 1 5m| COsh (?/) - COS^ (0) 

for final-state radiation, with 

Ai{R) = 7tR^, 



Af{R) = 27rlog{2/R)-^R' + 0{R' 



(5) 



capturing the geometric factors and the R dependence. One recognises the typical expected 
behaviour: the contamination due to initial-state radiation is proportional to the jet area, 
i.e. to vri?^, while the logarithmic behaviour at small R of the final-state radiation is the 
trace of the coUinear divergence in QCD. In both cases, the 1/Sm behaviour corresponds 
to the soft divergence of QCD. 



^This would be true for all the recombination-type algorithms, like kt [H] or Cambridge/ Aachen, with 
or without filtering, as well as for the SISCone T algorithm (at least in the z ^ 1 limit). 
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Note that the coefficient of the final-state radiation Af{R) has been expanded in series 
of R. The logarithmic behaviour at small R corresponds to the leading coUinear term 
as computed in [13] but we have kept enough terms in the expansion to have a correct 
description over the whole R range. In particular, the numerator in the argument of the 
logarithm will play a mandatory role to allow the best radius to go above 1, which is 
the case in the purely perturbative spectrum and, more generally, for large-mass gluon 
resonances. 

It is also important to discuss the choice of scale fi for a^. When using equation ([2]), 
the argument of the coupling is, for initial and final-state radiation respectively, 

2 _ {pi.k){p2.k) Z^M^ 
/Xj — l- 



iPl-P2) 4 

^ (kX) ^ ^[cosh2(y)-cos2(0)]. 

Assuming that emissions close to the edge of the jet are dominant, we will use the following 
simplified approximation: 

Sm RSm ,_, 

cosh(_K) cosh(it) 

As we are interested in the behaviour of the mass spectrum in the vicinity of the 
peak, the last thing we need to consider is the virtual corrections and the higher-order 
terms that diverge as a" log"(l/2;) in that region. In principle, this would depend on 
the jet algorithm under consideration and an exact computation would involve non-global 
logarithms |19l [20]. In practice, we shall adopt a simpler approach and assume that 
the one-gluon-emission result exponentiates. This is equivalent to the introduction of a 
Sudakov factor 

Sij = exp ( — / du ^'■^ 



5m\ d6m 



(8) 

where the integration is performed up to 5mmax = M/2, corresponding to z = 1 at y = 0. 
This is an arbitrary choice though a different one would only introduce corrections beyond 
the level of precision we are working at. Using equations (IH]), (jl]), (I?!) and (IE]) we find 



ddm \6m\ log(M/(2Aij)) 



log(|^m|/A,;) l^-^-^ 

log(M/(2A,;))J ' ^ ^ 



with 



K,j = ^A,j{R), A, = cosh(i?)AQCD and A; = ^^^^Aqcd. (10) 

We note that as long as Ki j is smaller than 1, the spectrum goes to when 5m tends 
to Ajj. Alternatively, we could regularise the Landau pole in the expression for but we 
have noticed no significant effect in our conclusions when doing so. 
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Figure 1: Coefficients 2a aC RAij{R) / tt'^ , governing the behaviour of the initial or final-state 
QCD spectrum as a function of R for different masses. The left plot covers the case of 
quark jets for which we use Cr = Cp = 4/3, and the right plot is for gluon jets for which 
Cr = Ca = 3. 



Together with the spectrum, we will also need the total probability for the reconstructed 
mass to be within a certain range, i.e. the integrated spectrum. Denoting by P{6mi, 5m2) 
the probability for mrec — M to be between 5mi and 5m2, we easily find 



P(Aij, 5m) 



\og{\5m\/K,j) Y 
\og{M/{2Kj))_ 



corresponding to the spectrum ([9]). Note that in this computation, we have assume that 
the spectrum is cut at the Landau pole, corresponding to \5m\ = Aij. 

Before proceeding with a more involved computation, it is important to comment on 
the limit of validity of this perturbative computation. From equations ([3]), (jl]) and (E]), one 
might expect higher-order corrections to be important when 2asCFtAij{R)/n'^ becomes of 
order 1. Fixing the scale in the coupling to M/2 for the sake of the argument, we have 
plotted that quantity on Fig. [1] for various cases of interest. In the case of quark-jets, 
both Ki and Kf remain smaller than 1. The case of gluon jets is more interesting; we see 
that the coefficient for final-state radiation goes above one at small R. In the case of a 
small-mass resonance, this can happen for values of R as large as 0.5. As a rule of thumb, 
we can thus say that, in the case of a small-mass resonance decaying to gluon jets, our 
perturbative computation would only be valid for R > 0.5. It is particularly interesting to 
relate that to the observation made in [3] that in this precise case of small-mass resonances 
decaying into gluons, we observed a disappearance of the peak in the reconstructed mass 
spectrum for small values of R. Though this was happening at slightly smaller values of 
R (around R ~ 0.4), the present computation provides an explanation of this effect: when 
2aaC RAij{R) / Ti"^ grows above 1, the peak in the final-state radiation spectrum ([9]) is no 
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longer around z = and the mass peak disappears. We shall come back to this point later 
in Section l42l 

Note also that, from Fig. [H one can have a feeling of what the optimal radius will be for 
parton- level events. Indeed, one can expect that, for R = Rhest, the spectrum will be nearly 
symmetric around the nominal mass [i.e. around z = 0), that is i^'j(i?best) = -^^/(-Rbest)- For 
both quarks and gluons and for all masses, this means that one should expect -Rbest ~ l-l- 
We will see later on that this is indeed relatively close to what we shall obtain but that 
PDF effects, computed in the next Section, induce some departure from that situation. 



3.2 PDF effects 

In order to get a better description of the variety of processes we consider over the whole 
kinematic range, we shall see that it is important to take into account the effect of the 
PDF on initial-state radiation. This addresses the fact that the emission of an initial-state 
gluon imposes to take the parton distribution functions at a larger momentum fraction x. 
This should therefore introduce an additional suppression, more important at large mass 
and for gluon-jets than for quark jets. 

Let us consider the production of a heavy particle at a fixed rapidity y. The leading- 
order cross section is easily obtained: 

<'!:°{y) = J dxidx2Mx,)Ux2)HxiX2S-Af)s(^\og(^^J-yj»{M,y) 

= '''' 

where the index a represents the colour of the colliding partons, fa{x) is the PDF implicitly 
taken at a scale M, and s is the centre-of-mass energy of the collision. This expression 
also comes with the kinematic constraint \y\ < log(M/-y/i). 

We should now add to that process an extra parton emitted at a rapidity rj. For our 
purpose of computing the initial-state radiation contamination to a jet, we can safely make 
the simplifying assumption that the rapidity of the gluon is the same as the rapidity of 
the "leading" parton, i.e., r] = y. By doing so, we decouple the effects of the PDF from 
the effects of the jet clustering. The latter will be reinserted later on by multiplying our 
results by the geometric factor Ai{R). We thus have 

^a''''(y) = ^ I dx,dX2dUaix,)Ux2)^5ix,X2^S-M')5(^^\0g 

In that expression, ^ is the longitudinal fraction of Xi carried by the parton entering the 
collision satisfyingj 1 — ^ ^ 1. If we rewrite it in terms of the longitudinal momentum of 
the emitted parton (measured w.r.t. the momentum of the beam) ^ = (1 — ^)xi, one gets 

^We only keep the dominant soft-gluon emission in the sphtting function. 
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after a bit of algebra 



Keeping the kinematic conventions introduced in the previous section, the longitudinal 
fraction ( can be rewritten in terms of the transverse momentum = zM/2 ^ 6m of the 
parton as 



e 



y 



C = ^6m, (14) 



which finally implies 



da^^^o, , 1 , f{M + 6m)ey\ , fMe~y\ 2C 



d6m s 27r2 y ^/s J \ ^/s J 6m 

with the kinematic constraint 

-log(^)<.<log(^)-log(l + -j (16) 

In the limit of small 5m, i.e. neglecting the 6m offset in the PDF, one gets 

da„^^o. . a^Cn 1 



d5m TT^ 6m 

which is exactly the result we have obtained in the soft limit up to the geometric factoi0 
lAiiK) that we will need to reintroduce at the end of the computation. 
To simplify the discussion, we shall assume that the PDFs take the form 

fa{x) = NaX-^-\\-xf'^. (17) 

The coefficients Na and will depend on the type of parton and the scale M but we shall 
assume a unique value for A. 

At this level, we could proceed by integrating over the rapidity y. This is feasible 
analytically with the choice of PDF f|T71) but it leads to a Gauss hypergeometric function 



in dPf^^^ l(16m for which the Sudakov factor ([8]) cannot be computed analytically. We will 
therefore carry on with the computation at a fixed rapidity y. 

Taking into account the emissions from both partons entering the collisions, normalising 
to the LO cross-section and reinserting the geometric factor, one obtains 



dP/°) asCa , 1 



-r 



-A-l 



+ 



ney I \ 1 — ne y 



d6m TT^ ^ 6m 

^This corresponds to the integration over the geometrical region in y and where the gluon is emitted 
in one of the two jets. 
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where we have introduced k = M/ y/s and r = 1 + 5m/ M, and it is understood that the 
terms in the squared brackets are restricted to the appropriate phase space i.e. /tre^^ < 1, 
respectively for each term. 

In order to compute the Sudakov form factor analytically, we shall make the approxi- 
mation that the exponents [3a are integers (in practice, we shall use /3g = 3 and /3g = 5 for 
(anti-)quark and gluon-jets respectively), and expand the small-x power behaviour to first 
order in 5m: 

(19) 



Using 



1 - KTe'^y 



1 - 



M 

Ke^y 5m 



(20) 



1 - ne^y ^ 1 - ne^y M 

the fact that is an integer allows one to see f|T8|) as a polynomial in 5m. 

We can then easily compute the Sudakov factor using the following fundamental inte- 
grals: 



du 



M/2 

Sm log(M/Ai 



p^l p=0 , 

= log 



log(M/(2A,)) 
\og{5m/Ai) 



(21) 



{Ei [p log(M/(2A,))] - Ei [p log((5m/A,)]} , (22) 



where Ei(x) is the exponential integral. 

With these simplifying assumptions, the final perturbative spectrum for initial-state 
radiation can be written as 



d5m log(5m/Aj) 
with the integrated probability 

log(5m/Aj 



fc=0 



5m 



k-l 



P,(Ai,5m) 



(23) 



P,{Ai,5m) 



log(M/(2A,)), 



exp 



k=i ^ ^ . 



and the coefficients 

l^k = z/fe - (1 + A)z/fc_i, 



^ r^k 



e 



n—e y 5m 



+ 



K — e y 

Ei [k log(M/(2Ai))] - Ei [k log((5m/A,)] . 



K 



K-ey 



e 



K — e^ 5m 



(24) 

(25) 

(26) 

(27) 
(28) 



Note that to obtain fl25]) . we have to integrate 5m up to M/2, which is strictly valid only 
when \y\ < log(K/2), but similar results can easily be derived at forward rapidities. 
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Figure 2: Reconstructed mass spectrum from our perturbative-QCD computation com- 
pared with Herwig at parton level (cyan curve). For the analytic computation, we show 
both the soft approximation (red curve) and the full case including PDF effects (blue 
curve). The left plot is for a qq dijet system at a nominal mass of 100 GeV while the right 
plot is for the gluonic case at 2 TeV. In both cases, the anti-Zcj algorithm with a radius of 
1 has been used for the clustering. 

3.3 Comparison with Monte Carlo 

We want to conclude this Section by a comparison of the predictions we obtain for the 
optimal radius from our analytical studies and from Monte-Carlo approaches. When deal- 
ing with the analytic result, we shall use the initial and final-state results computed in the 
previous sections in the two different situations for which we have analytic results: the soft 
limit with a running-coupling approximation, eqs. (|9]) and (fTTj) . and the "full" spectrum 
where we also include the PDF effects in the initial-state radiation, eq. (l23!l . 

The total perturbative QCD spectrum is the convolution of the initial and final-state 
pieces which, in practice, will be carried numerically. We shall use Aqcd = 200 MeV and 
Nf = 5 to compute the running of the QCD coupling, and, for the PDF effects in the 
initial-state-radiation spectrum, we shall adopt A = 0.3, /3g = 3 and Pg = 5, and assume 
y = 0, where we have checked that most of the massive objects in the Monte Carlo sample 
were produced. The quality measure is computed from the spectrum using ([1]). 

We start our comparison directly with the spectrum for the reconstructed dijet mass, 
as shown on Fig. |2] for quark jets, M = 100 GeV and gluon jets, M = 2 TeV. Generally 
speaking, the agreement between our analytic computation and Herwig is good, even very 
good for the 100 GeV quark-jet case. Note that this agreement is always better in the 
region of the peak, where the soft-gluon emissions approximation we are working with is 
supposed to hold. Similarly the small differences in the tail at small and large 
beyond the scope of our approximation. In both the quark and gluon-jet cases, we see that 
the inclusion of PDF effects, reducing the initial-state radiation, significantly improves 
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Figure 3: Quality measure as a function of R for the parton-level spectrum. The (red) 
dashed curve corresponds to our analytic computation in the soft limit while the (blue) 
solid one includes PDF effects. The (green) triangles and (cyan) circles show the Monte- 
Carlo simulations for Pythia and Herwig respectively. As for Fig. [21 the left plot is for a 
qq dijet system at a nominal mass of 100 GeV while the right plot is for the gluonic case 
with M = 2 TeV. 



the description, especially in the case of large-mass gluon jets, where we are sensitive to 
fast-falling gluon distributions at large x. 

Next, we turn to the computation of the quality measure, presented in Fig. [31 After 
properly taking into account the PDF effects, our computation agrees in shape with the 
Monte-Carlo results. For the normalisation, we are closer to the Herwig simulation in the 
case of quark jets, while we better agree with Pythia simulations for gluon jets. Since our 
analytic computation is identical in both cases, it it somehow hard to find an explanation 
for those differences. 

Finally, let us concentrate on the values of -Rbest extracted from the previous examples. 
This is shown on Fig. [Hfor quark jets (left panel) and gluon jets (right panel). Despite the 
apparent normalisation issue between Pythia and Herwig observed on Fig. [3l the fact that 
they agree on the shape of the quality means that they yield very similar extracted optimal 
radii. If we turn to the analytic computation, we first see that in the soft approximation, 
the optimal radius is independent on the process under consideration with -Rbest ~ 0.96 
regardless of the mass or the parton typ^. The reduction of the initial-state radiation due 
to the PDF effects has a sizeable effect on the extracted optimal radius in such a way that, 
at the end of the day, our analytic extraction of Rhest in the "full" model reproduces very 
well the Monte-Carlo simulations with an error < 0.1. 

^This is a bit smaller than the value around 1.1 expected from our discussion related to Fig. [TJ In 
practice, this is due to the fact that the quality measure is a complicated function of Aij[R) involving 
e.g. their derivatives which are different at R— 1.1. 
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Figure 4: Best radius as a function of the mass, obtained from our analytic computation 
and compared to Monte-Carlo results. The analytic curves correspond to the soft limit 
(dotted red curve), and full computation (solid blue curve). These are compared to values 
of -Rbest extracted from parton-level Pythia (green triangles) and Herwig (cyan circles) 
simulations. The left plot shows the case of a resonance decaying into qq jets while the 
right plot has gluon jets. 



4 Effects of the Underlying Event 
4.1 Description of the Underlying Event 

One of the most important effects when clustering jets at hadronic colliders is the presence 
of the underlying event (UE) coming from multiple interaction and beam remnants. This 
tends to produce a fairly uniform soft radiation of a few GeV per unit area that contam- 
inates the jets and affects their momentum reconstruction. In this Section, we therefore 
study the effects of this UE on the mass spectrum we have computed from perturbative 
QCD in the previous Section. 

If we work within the assumption that the UE is uniform in rapidity and azimuth, the 
average pt contamination to each jet will be ^jetP where ^jet is the (active) area of the jet 
[22] and p the UE density per unit area. For a given event, this contamination is affected 
by two kinds of fluctuations: first, the fluctuations of the background itself which go like 
jctO", with cr the background fluctuations per unit area. Then, keeping in mind the 
possibility to discuss our results for different jet algorithms than the anti-fcf algorithm, the 
active area of a jet is in general known to fluctuate, with an average value proportional to 
7ri?2 that increases with in a different way for each jet algorithm. 

For a given background density p, let us denote by dPj/dpt^fc the probability distribution 
for a jet k to receive a contamination "pt^k from the UE. For a jet of area ^jct = vri?^ a^, this 
distribution has an average of vri?^ a^p and a dispersion ^/tiB? akcr. We therefore also have 
to introduce dPa/da^ the distribution, of average a and dispersion cja of the area divided 
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by TcR"^, of the jet k. 

If the two jets were perfect circles of radius R, one could easily compute that the 
offset on the reconstructed mass would 6m = 27iRIi{R){pi + P2), up to corrections 
proportional to 6m'^/M, where Ii{x) is the modified Bessel function of the first kind. For 
generic jets we shall then use 



Sm 



2h{R) 
R 



{Pt,l + Pt,2) 



(29) 



where pt^i and pt^2 are the transverse momenta of the background contamination of our 
dijets. 

On top of that, when we consider an event sample, the values of p and a will fluctuate 
from one event to another. For the event-by-event fluctuations of the background densities, 
we shall assume a distribution dPg/dp, of average (p) and dispersion a^. For the intra-event 
fluctuations (per unit area) a we shall simply assume that they remain proportional to 
p, i.e., p = a/p = (a/p). 

Given the event-by-event p distribution dPg/dp, the intra-event fluctuations, dPi/dpt^k 
and the area distribution dPa/dak, the UE spectrum can be inferred from f l2^ 



dP, 



UE 



d6m 



dp 



dPe 
dp 



daida2 



di^di^ 
dai da2 



, , dPi dPi ^ 

dpt,idpt,2-, -, 

dpt,i dpt,2 



2/1 (P), 
^(Ri +R2, 



(30) 

Without knowing exactly that distribution, we can compute its average and dispersion 
in terms of the properties of the fundamental distributions: 



{6m) 



2 



d5m Sm 
2/1 (P) 



dp 



UE 



P 



d6m 
27rR^a (p) 



dSm {Sm — {Sm))' 



dP 



UE 



dSm 



(31) 



(32) 



2/1 (P) 
P 



[4a^(rf)VJ + 2{nRr ((p)^ + aj) al + 2a(rf) ((p)^ + aj) p^] . 



The average is consistent with what one would naively expect, and the dispersion is a 
combination of the three separate sources of dispersion we are including: event-by-event 
fluctuations in p (the first term), area fluctuations (the second term) and intra-event fluc- 
tuations (the third term). As expected, the first two are proportional to the area of the 
jet i.e. to vrP^, while the intra-event fluctuations are proportional to its square root. 



^The Bessel function mostly come from the fact that particles emitted far from the jet axis will con- 
tribute more to its energy. 

^"in terms of the toy model for the underlying event introduced in 23], this corresponds to a fixed 
density of particles with a varying mean pt . 
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So, instead of modelling the various distributions separately and perform the integra- 
tions in (130!) . we shall directly model the total UE spectrum and adjust its parameters to 
reproduce the total average and dispersion. More specifically, we shall assume 



dPuE _ 1 
dSm T(a) 6mQ 



g-5m/5mo_ (33) 



The parameters 6mo and a can be adjusted to reproduce the average fl?Il) and dispersion 
of the UE distribution. One finds 

« = ^^ and 5mo = -^. (34) 
This choice has the advantage over a Gaussian distribution that it remains positive. 



4.2 Computation and comparison with Monte Carlo 

Our strategy is rather straightforward: as we did when combining the initial-state and 
final-state radiation spectra, we shall convolute the perturbative QCD spectrum, discussed 
in Section 13.31 with the UE spectrum ( l33|) . For each mass and parton-type, we wish to 
compare our results with the Monte-Carlo studies [4J. 

For the perturbative QCD part of the spectrum, we use the convolution already dis- 
cussed in Section 13.31 and consider the same models: the pure soft approximation and the 
spectrum including PDF effects in the initial-state. The result is itself convoluted with the 
underlying-event distribution for which we still have to determine the parameters appear- 
ing in eqs. ( 13T1) and ( l32l) . The area properties a and cxa have been calculated, together 
with their scaling violations in [22j. The properties of the background are computed di- 
rectly from the Monte-Carlo event sample using FastJet [21] and the method advocated 
in [25] and [23]: each event is first clustered with the Cambridge/ Aachen algorithm with 
R = 0.6; p and a are then estimated from the jets that are within a rapidity strip of 1 unit 
around the hardest dijet system, excluding the two hardest jets in the event. The average 
properties (p), fi and can then be easily extracted from the complete event sample. For 
completeness, we give a table of the values we have obtained in Appendix \M 

With that method, we are able to compute the mass spectrum, and thus the quality 
measure, at a given mass and with a given parton type and UE characteristics. We then 
extract -Rbest by searching the minimum of the quality measure by steps of 0.01 in R. 

As for the parton-level case, we shall compare our computations with the Monte Carlo 
simulations from Pythia (tune DWT) and Herwig (with default Jimmy tune) as the details 
of the Underlying Event can differ. We shall again proceed in three steps: first consider 
the histograms, then the quality measures and finally the optimal radius. For simplicity, 
we shall focus on the anti-fcj algorithm for which a = 1 and cx^ = 0. 

Let us start with the histograms. Figure where we illustrate our description of the 
spectrum for the quark dijets at small mass and gluon dijets at large mass. In both cases, 
our analytic histogram correctly reproduces the behaviour observed with Herwig. We note 
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Figure 5: Reconstructed mass distribution for quark dijets at 100 GeV (left panel) and 
gluon dijets at 2 TeV (right panel). We compare the Monte-Carlo simulations from Herwig 
with our analytic computation. In both cases, we have used R = 0.7 and the anti-kt 
algorithm. 




Figure 6: Quality measures as a function of R for quark dijets at 100 GeV (left panel) and 
gluon dijets at 2 TeV (right panel). The triangles and solid lines correspond to the Pythia 
simulations while the circles and dashed lines correspond to Herwig simulations. 
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Figure 7: Best radius as a function of the mass when including the UE. Our analytic 
results are compared with Pythia Monte-Carlo simulations. The left (right) plot is for 
quark (gluon) jets. 
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Figure 8: Same as Fig. [3 but for Herwig simulations. 
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however that our analytic computation shghtly underestimating the UE in the peak region 
in the case of quark jets at small nominal mas^ M. If we then move to the case of the 
quality measure, Figure |6l we see that, given the differences we have already observed at 
the parton level (see Figure |3]), the addition of the underlying event gives a reasonable 
description of the Monte-Carlo results, especially the shape of the quality at large R. Note 
that only the curves corresponding to the model including the PDF effects are shown on 
Figure |6] for clarity reasons. 

Finally, the extracted optimal radius is presented on Figures [7] and [8] for Pythia and 
Herwig simulations respectively, as a function of the mass for both quark and gluon jets. 
Generally speaking we obtain a good description of the Monte-Carlo results. We see once 
again that the inclusion of the PDF effects significantly improves the description of the 
optimal radius for gluon jets at large mass, as expected. We slightly underestimate the 
optimal radius in the case of gluon jets at large mass, but we have to notice that this is the 
regime on which it matters the least as in that region the minimum in the quality measure 
is rather flat. 

We thus see that our complete analytic model, the sohd (blue) curve on Figures [7] 
andlH corresponding to our analytic pQCD computation including PDF effects, eq. f l23|) . 
convoluted with our model of the Underlying Event, eq. f p3|l . gives a good extraction of 
the optimal radius -Rbest- 

4.3 Computation for other algorithms 

Now that we have checked the agreement between our analytic computations and Monte- 
Carlo simulations for the simpler case of the anti-fcj algorithm, let us briefly discuss other 
possible jet algorithms. In this section, we shall apply our analytic computations to the 
case of the Cambridge/Aachen (C/A) as well as to the Cambridge/Aachen algorithm sup- 
plemented by a filter (C/A+filt). While the former usually leads to quality measurements 
close to what is obtained in the anti-/ct case, the latter produces narrower peaks with an 
optimal radius slightly larger than the one obtained from C/A without filtering. For sim- 
plicity, we shall follow what has been done in p[] and use i^mt = R/2 and ngit = 2 for the 
parameters of the filter, i.e., recluster each jet with C/A and a radius of half the original 
one and keep the two hardest subjets, discarding the softer ones. 

The analytic computation goes exactly as in the case of the anti-/cj algorithm. The 
only thing that will change in our approach is the inclusion of the area contribution in the 
UE distribution. The expressions for the area are computed analytically, as in [22], up to 

^^In the case of Pythia simulations, this underestimation, still observable in the case of quark jets at 
small mass M, is a bit more pronounced. 
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Figure 9: Best radius as a function of the mass for different algorithms. The hues are 
obtained using our analytic approach while the points correspond to Pythia simulations. 
The left plot shows the case of quark dijets and gluons are shown on the left panel. The 
solid (blue) hues together with the squares correspond to the anti-Zc^ results, the dashed 
(green) lines and the circles show C/A results, and the dotted (red) lines and the triangles 
correspond to C/A with filtering. 



order a,: 



^O.JD + d. 



^0,JD 



JD 



(3qtc 



log 
log 



Qs(QojD) 

as{Rpt) 



as{Rpt 



(35) 
(36) 



where the coefficient Aq, d, Eq and s are computed in perturbation theory and have been 
calculateclll in [22], while the scales Qq are of non-perturbative origin and have been 
determined in [22] from a fit to Herwig simulations. In our case, we shall use pt = M/2 
together with the following values for the coefficient^ : 



Ao,c/A = 0.814, dc/A = 0.083, S^,c/a = 0.0687, = 0.036, (37) 

^o,c/A+mt = 0.386, rfc/A+mt = 0.019, S^c/A+mt = 0.0046, s^/A+mt = 0.004, (38) 

^o'!c/A = ^o'!c/A+fiit = 0.26 GeV, Qo%/a = *5o^c/A+fiit = 0.53 GeV. (39) 

To avoid multiplying the figures, we just show, see Figure [HI the final result of the 
extracted optimal radius as a function of the mass for the three algorithms under con- 
sideration, compared to Pythia simulations. We see that, in agreement with the Monte 



^^For the h, C/A and SISCone algorithms. 

-"^^The coefficients for the filtered case have been computed with the same method as in ^22, and we have 



simply assumed the same non-perturbative scales as for the C/A algorithm. 
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Carlo, the differences between anti-Zcj and Cambridge/ Aachen are very small and thus our 
analytic approach reproduces the expected behaviour correctly. For the case of the C/A 
algorithm with filtering, we obtain a good description in the case of quark dijet systems but 
systematically underestimate the optimal radius, by about 0.1-0.2, in the case of gluon jets. 
We believe that this is due to the fact that we should also introduce some dependence on 
the algorithm at the level of the perturbative spectrum. In our soft-gluon approximation 
all the algorithm will have the same initial and final-state radiation spectra. However, with 
a more complete treatment, e.g. including non- global logarithms as in [19], differences will 
start appearing. Another effect comes from the combination of the initial and final-state 
radiation spectra: the simple convolution we have used is reasonable in the case of the 
anti-A;^ algorithm where the shape of the jet is not affected by the radiation, but more 
subtle phenomena will appear for more complex algorithms, especially in the filtered case 
where, in some geometric regions, the capture of ISR gluons can be affected by internal 
gluon radiation in the jet. 

All these effects are beyond the reach of the present analysis. As expected from the 
previous discussion, discrepancies are more pronounced in the case of gluon jets at large 
mass, where perturbative radiation dominated the spectrum. Our computation neverthe- 
less gives reasonable results at the end of the day, given also that for gluon dijets at larger 
mass, the minimum in the quality measure is less pronounced than in other situations. 

Finally, let us mention that the case of the kt algorithm [21] is also very close to what 
we obtain for the anti-fct and C/A algorithm, hence well reproduced by our calculation. 
For the SISCone algorithm, differences in the perturbative spectrum also appear when the 
radiated gluon is not soft, we therefore postpone that case for a further study. 



4.4 Comparison with earlier approaches 

To some extent, the present calculation can be seen as an extended version of what has 
been done in [13j, where the authors compute the average pt shift of a jet due to final-state 
radiation, hadronisation and the UE. The approach we pursue in this paper is however 
much more complete in the sense that we obtain the full spectrum instead of just the 
average valuS As a consequence, we can extract the width of the peak really as the 
dispersion in the spectrum, while in [13] the width is approximated by 

{5pl) ^ {Sp,)l^, + (5p,)Ldr + (^P*)uE (40) 

i.e. the total dispersion is approximated by the sum of each independent average squared. 
The individual contributions are found to be [T3] 

cy 

{^Pt)pevt = —iog{R)LaPt, 

—9C 

{SPtK,r = -^Af^l), (41) 

^^Also, we did not include hadronisation effects which are not important in the case we are concerned 
with, and we include initial-state radiation effects which are important to get the partonic picture right. 
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Figure 10: Best radius as a function of the mass for our approach compared to the previous 
work of [13]. The sohd (blue) hne corresponds to our present computation in this paper 
while the dotted (red) line shows the result obtained in Ref. [13]. The Pythia results, the 
(green) triangles, are shown for comparison. The left (right) panel shows the case of quark 
(gluon) jets. 



with 2CFA{fii) = 0.5 GeV for fij = 2 GeV and 

(42) 



L, = (21og(2)-|)Cp, 



L, = (21og(2)-l^)c^ + liV,T«. 



The optimal radius is then obtained by minimising fHU]) w.r.t. R. 

In practice, we have used pt = M/2, Nf = 5, Aqcd = 200 MeV and computed the QCD 
coupling at the scale Rpt- Instead of using a fixed value for Aue as in [13], we have used 
the fact that it is related to the average background density, (p) , through (see Section 14. ip 

AuE = 27ra(p)||||, (43) 

and used the values extracted from the Monte Carlo event samples and already used in 
Section Wl] for (p). 

The extracted value of -Rbest is shown on Fig. [TU] in the case of Pythia simulations, for 
the anti-fct algorithm. The results are compared with the Monte Carlo results and with our 
complete analytic calculation. We see that the simpler approach of [13] manages to extract 
the correct value for -Rbest at small mass but fails to reproduce the observed behaviour at 
larger mass, and this for both quark and gluon jets. The reason for this failure is rather hard 
to pinpoint: it can come, e.g., from the crudeness of the approximation fHUl) . the lack of an 
initial-state radiation contribution and the corresponding PDF effects, or the description 
of the underlying event in terms of the average density (p) instead of its fluctuations ap. 



22 



5 Background subtraction 
5.1 Description 

The last point we wish to discuss is the situation where we perform background subtraction 
using jet areas [22], as advocated in [25]. The idea behind UE subtraction is to get rid of 
the shift due to the UE density contaminating the mass reconstruction. Since subtraction 
is performed on each single event, one hopes to get rid of the fluctuations of the background 
density across the events, hence obtaining a narrower peak. In this respect, all we should 
be left with is the background fluctuations inside an event which have a linear dependence 
on R rather than the quadratic one computed in the unsubtracted case ( 132|) . This is a 
much smaller effect and one thus expect the quality measure to be better {i.e. smaller) 
than in the unsubtracted well as the optimal radius to be larger. 

In practice, one would thus naively convolute the perturbative spectrum computed in 
Section |3] with a subtracted UE spectrum taking into account these intra-event fluctuations. 
Doing that would actually lead to a quality much better than what is observed in Monte- 
Carlo studies and to a systematic over-estimation of -Rbest- 

The additional effect that is mandatory to take into account is the fact that subtraction 
is performed using an estimated value for the background density pest which can differ from 
the exact value of p for that event. This results in some smearing effect left over from the 
unsubtracted spectrum. Because of this, the computation of the subtracted UE spectrum 
turns out to be more involved than the corresponding unsubtracted case computed in 
Section 14. mostly because one also would have to consider the distribution probability 

for Pest- 

The properties of the estimated value of p have actually recently been discussed at 
length in [23], where we learn that the estimation of the background density using a median- 
based approach suffers from two sources of bias: one of soft origin, related to the fact that 
the median of a set of pure-UE jets may differ from its average; and a contamination from 
radiation coming from the hard jets. If we call p and a the average UE density and its 
fluctuations for one given event, and define 6 = (pest — p)/p as the relative error in the 
estimation of p, the average and dispersion of 6 are [23] 

{5) ~ dip — d2p^ and a1 ~ S^p^ 

where, as in Section WA\ we used p = oj p, and the coefficients ar^ 

^^For the hard contamination, we assume, as we shah do in the next Section, that the "bare" jets have 
been excluded from the range. 
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(mm(Qo, ^cjaRp)) 



(47) 
(48) 



In the previous set of equations, cj ~ 2.04 is a pure number extracted in [23], Rp is the jet 
radius used to estimate the background properties, A^ot is the total area of the range we 
are using to estimate the background, Qo is a soft cut-off that we fixed to 1 GeV, and Q 
is the hard scale that we naturally took equal to M/2. 

These properties constrain the distribution of 5, that we shall denote by dP^/d^, as a 
function of /i. Since the intra-event fluctuations will play a substantial role in this UE- 
subtracted situation, we shall also consider a distribution for them. For simplicity, we 
shall parametrise the distribution in terms of rather than in terms of a as background 
estimations are naturally expressed in that variable. We therefore introduce dP^/d/i, of 
average (/x) and dispersion cr^. In the course of the computation below, we shall encounter 
higher cumulants for this distribution. To keep things simple, we shall work with the 

The 



Gaussian assumptioio z.e. {n^) = (/i) ((//) + 3cr^), (/i^) = (/i) + 6 (/i)" 
values of (/i) and can again be extracted from the Monte-Carlo samples. 

With this at hand, we proceed, as in Section 14. by computing the mass shift for a 
given configuration and deducing its average and dispersion. For given values of p, /i. 
Pest, jet areas and jet UE contaminations pt^k, the offset in the reconstructed mass after 
subtraction is 

~2h{Ry 



6m 



R 



[Pt,i +Pt,2 - TcR'^{ai + a2)(l + 6)p\ 



(49) 



where the last term is the subtracted piece. 

After a slightly lengthy computation we reac^^''' 

'2h{R) 



(Sm) 



J2 
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2h{R) 
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27rp2a (p) [-di (p) + c?2((p)' + al)] 



(50) 



({ [27rR'-a + 2{7:R'fal{dl + Sl) + A^T^R'afS'^ {{p)' + 

+4(rfa)2rf?aJ}((p)VaJ) 
+ [2{^R^fal{{pf + al) + 4(vrP^a) [dl (p^> - 2d,d, (p^» 
+ 4(rfa)2 {pf [2dlal + {2d, (p) - d,f] al) . (51) 

Once we have these expressions, the last step is to parametrise the subtracted UE 
spectrum. Since subtraction can produce negative values for Sm, we cannot use ( l33i) . 



^^Alternatively, one can use the moments obtained from a Beta distribution which would enforce the 
positivity of /Lt. We have checked that this choice does not affect the resulting i?bcst- 

^^The computation proceeds as in Section [4. II The only additional approximation is the replacement of 
a by (/i) (p) in {rih) (eq. (1481) ). which only introduces subleading corrections. 
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The technique we shall adopt is to keep the same distribution using the unsubtracted 
average together with the dispersion we have just computed, and then to shift the complete 
distribution in order to reproduce the subtracted average 5m. In practice, this gives 



dP, 



UE ,sub 



Via) 



{6m + 5mi)" ^ 
6mn 



- (Sm+Smi ) / Smo 



(52) 



where the parameters 6mo, 6mi and a are given by 
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6mi = {6m) 



unsub 



{6m) . (53) 



unsub 



with {6m) and aj^ given by eqs. ( 150|) and ( 15T|) . and ((5"^)^^^^^^ by eq. (13T|) . 

Finally, it is interesting to notice that in the case of a perfect estimation of the back- 
ground density pest = P, i-e. di = d2 = Sd = 0, one recovers 



{6m) 



0, 
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5m 



2h{R) 
R 



2rfa((p)^ + aj)((/i) 



a, 



(54) 
(55) 



i. e. no average shift and a (squared) dispersion that is proportional to the area (linearly) 
and to (T^. 



5.2 Comparison with Monte- Carlo simulations 

As for the previous cases we have analysed, we shall perform a comparison of our analytic 
predictions with Monte Carlo simulations for the UE-subtracted case. The method goes 
exactly as before, so we just highlight the main steps and present our results for the optimal 
radius -Rbest and stay with the anti-/ct algorithm. 

For the analytic part of the computation, the perturbative QCD spectrum is convoluted 
with the subtracted UE spectrum calculated in the previous Section. Concerning the 
parameters of the latter, all background properties, (p), o"^, (p) and o"^ are extracted 
directly from the Monte-Carlo data sample and the estimation of the background density 
was performed using the Cambridge/ Aachen algorithm with a radius of 0.6. We have kept 
all the jets within one unit of rapidity around the hardest dijet, excluding the two hardest 
jets in the event, giving a total area A^ot = 47r. It is interesting to notice that when 
background subtraction is performed, the final results will slightly depend on the details 
of how p is estimated, through the coefficients di, d2 and Sd in our analytic approach. 
Extending the range, i.e. increasing its area Atot, would decrease the bias in the estimation 
of p, but in that case, there would be a risk that the background would not be uniform over 
the range [e.g. vary with rapidity). Following [23], we could therefore also try to optimise 
the choice of range and jet definition used for estimating p. 

The computed value of -Rbest is shown on Fig. [TT] in the case of Pythia simulations. 
For an easier comparison, we show both the subtracted and unsubtracted results. For the 
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Figure 11: Best radius as a function of the mass for both the UE-unsubtracted and the 
UE-subtracted cases. The sohd (blue) hne corresponds to our subtracted prediction and 
has to be compared to the Pythia Monte-Carlo simulation shown as (green) circles. For 
reference, we also show as a (red) dotted line the unsubtracted predictions, together with 
the Pythia results shown as the (cyan) triangles. The left (right) panel shows the case of 
quark (gluon) jets. 

analytic calculation, we observe a value of -Rbest larger in the subtracted case by about 0.04 
in the case of quark-jets and by about 0.08 in the case of gluon-jets. All these values are 
in good agreement with what is observed in the Monte-Carlo simulations. 

6 Conclusions and perspectives 

Throughout this paper we have investigated analytically the optimisation of the radius R 
one uses with a jet algorithm to reconstruct dijet resonances. We have already learnt [1] 
that carefully choosing the jet radius can lead to significant improvement of the recon- 
structed signal and we find it valuable to supplement previous Monte- Carlo studies with 
an analytic understanding of these dijet reconstructions. 

Our approach has been to calculate the reconstructed dijet mass spectrum. This was 
done by convoluting fundamental pieces: the perturbative QCD radiation in the initial 
and final state, and the contamination due to the Underlying Event. For the perturbative 
part, we have worked in the soft-gluon emission approximation, a choice motivated by the 
fact that we are mostly interested in the behaviour in the vicinity of the mass peak. In 
the case of the initial-state radiation spectrum, we have also included PDF effects that can 
have a non-negligible impact, especially in the case of gluon jets and at large dijet mass. 
For the UE description, we have modelled it in such a way as to reproduce the observed 
properties of the background. From the reconstructed mass spectrum, we can compute a 
quality measure and extract an optimal radius, following the method outlined in [1]. 
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Our analytic results have been considered and compared with Monte-Carlo simulations 
in various cases. We have started with parton-level events, where we can test our analytic 
computation of the perturbative spectrum; then we have focused on full hadronic events 
including the Underlying Event; and, finally, we have considered the situation with a jet- 
area-based background subtraction [25J. We have mostly considered the case of the anti-kt 
algorithm but have also considered other options. 

Generally speaking, we can say that we obtain a good description of the Monte-Carlo 
results, especially when including the PDF effects in initial-state radiation. The optimal 
radius -Rbest computed analytically is in most cases less than 0.1 away from the Monte- 
Carlo expectation. The comparison at a more basic level, the quality measure or even the 
mass spectrum, also shows a good agreement. 

Most of the deviations between our analytic results and Monte Carlo simulations can 
probably be traced back to what we observe at the partonic level. Though our analytic 
computation reproduces nicely the Monte-Carlo simulations, we have observed some devi- 
ations in Section 13.31 It is interesting to notice that the differences between our description 
and the Monte-Carlo results is usually of the same order as the differences obtained be- 
tween Pythia and Herwig. These differences propagate to the situation where the UE is 
included. 

The one case where our analytic results deviate a bit more from the Monte-Carlo 
simulations is the case of the Cambridge/Aachen algorithm supplemented by a filter (see 
Section 14. 3p . for events including the UE, while our description remains good for the 
anti-kt and the Cambridge/Aachen algorithm without any filter. We think that a better 
description goes beyond the reach of the present analysis, e.g. already at the partonic 
level, a simple convolution between the initial and final-state spectra may be too simplistic 
when filtering is applied. 

Let us also notice that our description of the UE allows us to treat successfully both the 
case where the UE is subtracted and the case where it is not. Our approach includes all 
the effects that contribute to the dispersion of the reconstructed mass: mostly background 
fluctuations from one event to another, intra-event fluctuations and area fluctuations. It is 
interesting to notice that in the case of subtracted UE, it was also mandatory to take into 
account the fact that the estimated background can differ from its "true" value. This leads 
to a significant additional contribution to the dispersion. Using the analytic computations 
from [23] led to good results for the extracted -Rbest- 

If one wishes to improve the present description, additional effects have to be included 
in the analytic computation: relaxing the soft-gluon-emission approximation, obtaining 
sub-leading logarithmic corrections, performing a better resummation of the soft virtual 
corrections than a simple Sudakov factor, e.g. using non-global logarithms, and including 
hadronisation corrections. However, we have seen that the description we can achieve with 
our simple approach is already satisfactory without having to introduce these corrections. 
It is nevertheless important to remember their existence as they might become relevant in 
more specific situations, like the case of filtering where we have observed that we start to 
reach the limits of our simple approach. Some effects, like subleading logarithmic correc- 
tions, may even be simpler to compute analytically than to implement in a Monte Carlo. 
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Another point that we have not investigated and that might be interesting in a future 
study is the case of the rapidity dependence of -Rbest- As we have seen in Section I3l2| PDF 
effects introduce a dependence on the rapidity at which the heavy object is produced [6] 
and it might be interesting to see how -Rbest varies with that rapidity. 

Also, all the computations we have performed were for a complete event-set. A next 
major step would be to optimise the radius on an event-by-event basis. This is much more 
complicated as the previous approach would then only give access to an average dispersion 
for a single event and the additional dispersion arising when combining the whole set of 
events would be much harder to handle. We therefore also leave that step to future studies. 

The question of knowing which jet algorithm should be preferred has already triggered 
many discussions both on the theoretical and on the experimental side, and more complex 
jet definitions introducing more parameters have often been proposed to improve jet re- 
construction. The present work, together with [19] , can be seen as opening a new direction 
of improvement: analytically optimising the parameters in the jet definitions. Considering 
a single parameter, the jet radius, for a simple process, dijet reconstructions, is a first 
step in that direction. Gaining analytic control on the parameters entering jet definitions 
can be extremely valuable. This is especially true when new parameters are introduced: 
being able to predict them can avoid often costly determinations based on Monte-Carlo 
simulations or rough empirical choices. The application of the techniques developed in this 
paper to other parameters and more general processes would therefore be very interesting. 
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A Extracted background properties 

For the sake of completeness, we briefly mention in this appendix the values of the back- 
ground properties that are necessary to fix the UE spectrum. These will differ between 
quark and gluon jets and between Pythia 6.4 (tune DWT) and Herwig 6.51 (with default 
Jimmy tune). All the values used in our analytic computation are summarised in Table [TJ 
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